Bayesian Analysis of Online Newspaper Log Data

نویسندگان

  • Hannes Wettig
  • Jussi Lahtinen
  • Tuomas Lepola
  • Petri Myllymäki
  • Henry Tirri
چکیده

In this paper we address the problem of analyzing web log data collected at a typical online newspaper site. We propose a two-way clustering technique based on probability theory. On one hand the suggested method clusters the readers of the online newspaper into user groups of similar browsing behaviour, where the clusters are determined solely based on the click streams collected. On the other hand, the articles of the newspaper are clustered based on the reading behaviour of the users. The two-way clustering produces statistical user and page profiles that can be analyzed by domain experts for content personalization. In addition, the produced model can also be used for on-line prediction so that given the user cluster of a person entering the site, and the page cluster of an article of a newspaper, one can infer whether or not the user will have a look at the

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discoursal Analysis of Rhetorical Structure of an Online Iraqi English Newspaper

Abstract Rhetorical structure is helpful in improving how the writers maintain cohesion in their writings. This study examines how the Iraqi writers maintain cohesion in the text by analyzing the various rhetorical moves in Azzaman, an online Iraqi newspaper. To this purpose, twelve opinion articles from Azzaman Iraqi newspaper, published from January 2013 to June 2013 were analyzed. The findin...

متن کامل

Discoursal Analysis of Rhetorical Structure of an Online Iraqi English Newspaper

Abstract Rhetorical structure is helpful in improving how the writers maintain cohesion in their writings. This study examines how the Iraqi writers maintain cohesion in the text by analyzing the various rhetorical moves in Azzaman, an online Iraqi newspaper. To this purpose, twelve opinion articles from Azzaman Iraqi newspaper, published from January 2013 to June 2013 were analyzed. The findin...

متن کامل

Bayesian paradigm for analysing count data in longitudina studies using Poisson-generalized log-gamma model

In analyzing longitudinal data with counted responses, normal distribution is usually used for distribution of the random efffects. However, in some applications random effects may not be normally distributed. Misspecification of this distribution may cause reduction of efficiency of estimators. In this paper, a generalized log-gamma distribution is used for the random effects which includes th...

متن کامل

Thematic Progression in the Rhetorical Sections of an Online Iraqi English Newspaper

Abstract Thematic development refers to the way theme and rheme in the clause are developed. The theory of rhetorical structure can be defined as the strategies that follow specific ways to make writing more persuasive. The present study aimed to examine how Iraqi writers maintain cohesion in the text by analyzing the patterns of thematic progression in various rhetorical sections in an online ...

متن کامل

Bayesian Analysis of Censored Spatial Data Based on a Non-Gaussian Model

Abstract: In this paper, we suggest using a skew Gaussian-log Gaussian model for the analysis of spatial censored data from a Bayesian point of view. This approach furnishes an extension of the skew log Gaussian model to accommodate to both skewness and heavy tails and also censored data. All of the characteristics mentioned are three pervasive features of spatial data. We utilize data augme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003